Sub-Space Missing Feature Imputation and Environment Sniffing for Robust Speech Recognition

نویسنده

  • Yuan-Fu Liao
چکیده

Noise robustness is the most important issue for real-life speech recognition/spoken dialogue system. In this paper, a sub-space missing feature theory (S-MFT) front-end is proposed to alleviate the corruption of the background noise. SMFT incorporates temporal information and applies principle component analysis (PCA) to find a noise-suppressed subspace for more precise missing feature imputation. Moreover, two parameters including signal-to-noise ratio (SNR) and divergence-based reliability measure of an input utterance generated by S-MFT are employed to judge whether the utterance can be successfully recognized or not. Experiments on TIDIGIT and three kinds of noises from NOISEX-92 corpora have verified the benefits of the proposed S-MFT approach. The average recognition rate could be improved from 61.6% (cepstral mean and variance normalization, CN) to 77.1% and further to 89.9% while rejecting bad utterances.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه

Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...

متن کامل

Missing Feature Imputation of Log-spectral Data for Noise Robust Asr

In this paper, we present a missing feature (MF) imputation algorithm for log-spectral data with applications to noise robust ASR. Drawing from previous work [1], we adapt the previously proposed spectrographic reconstruction solution to the liftered log-spectral domain by introducing log-spectral flooring (LS-FLR). LS-FLR is shown to be an efficient and effective noise robust feature extractio...

متن کامل

Recognition of reverberant speech by missing data imputation and NMF feature enhancement

The problem of reverberation in speech recognition is addressed in this study by extending a noise-robust feature enhancement method based on non-negative matrix factorization. The signal model of the observation as a linear combination of sample spectrograms is augmented by a melspectral feature domain convolution to account for the effects of room reverberation. The proposed method is contras...

متن کامل

Mask estimation in non-stationary noise environments for missing feature based robust speech recognition

In missing feature based automatic speech recognition (ASR), the role of the spectro-temporal mask in providing an accurate description of the relationship between target speech and environmental noise is critical for minimizing the degradation in ASR word accuracy (WAC) as the signal-to-noise ratio (SNR) decreases. This paper demonstrates the importance of accurate characterization of instanta...

متن کامل

Coupling identification and reconstruction of missing features for noise-robust automatic speech recognition

The standard missing feature imputation approach to noiserobust automatic speech recognition requires that a single foreground/background segmentation mask is identified prior to reconstruction. This paper presents a novel imputation approach which more closely couples the identification and reconstruction of missing features by using a probabilistic framework based on the speech fragment decod...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005